NBA: defensive distillation for backdoor removal via neural behavior alignment
نویسندگان
چکیده
Abstract Recently, deep neural networks have been shown to be vulnerable backdoor attacks. A is inserted into via this attack paradigm, thus compromising the integrity of network. As soon as an attacker presents a trigger during testing phase, in model activated, allowing network make specific wrong predictions. It extremely important defend against attacks since they are very stealthy and dangerous. In paper, we propose novel defense mechanism, Neural Behavioral Alignment (NBA), for removal. NBA optimizes distillation process terms knowledge form samples improve performance according characteristics defense. builds high-level representations behavior within order facilitate transfer knowledge. Additionally, crafts pseudo induce student models exhibit behavior. By aligning from with benign teacher network, enables proactive removal backdoors. Extensive experiments show that can effectively six different outperform five state-of-the-art defenses.
منابع مشابه
Extending Defensive Distillation
Machine learning is vulnerable to adversarial examples: inputs carefully modified to force misclassification. Designing defenses against such inputs remains largely an open problem. In this work, we revisit defensive distillation—which is one of the mechanisms proposed to mitigate adversarial examples—to address its limitations. We view our results not only as an effective way of addressing som...
متن کاملOn the Effectiveness of Defensive Distillation
We report experimental results indicating that defensive distillation successfully mitigates adversarial samples crafted using the fast gradient sign method [2], in addition to those crafted using the Jacobian-based iterative attack [5] on which the defense mechanism was originally evaluated.
متن کاملClassifying NBA Offensive Plays Using Neural Networks
The amount of raw information available for basketball analytics has been given a great boost with the availability of player tracking data. This facilitates detailed analyses of player movement patterns. In this paper, we focus on the difficult problem of offensive playcall classification. While outstanding individual players are crucial for the success of a team, the strategies that a team ca...
متن کاملImage alignment via kernelized feature learning
Machine learning is an application of artificial intelligence that is able to automatically learn and improve from experience without being explicitly programmed. The primary assumption for most of the machine learning algorithms is that the training set (source domain) and the test set (target domain) follow from the same probability distribution. However, in most of the real-world application...
متن کاملDefensive Distillation is Not Robust to Adversarial Examples
We show that defensive distillation is not secure: it is no more resistant to targeted misclassification attacks than unprotected neural networks.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Cybersecurity
سال: 2023
ISSN: ['2523-3246']
DOI: https://doi.org/10.1186/s42400-023-00154-z